대규모 병렬 프로세서 프로그래밍: 실습 중심 접근법: 일반 목적 GPU 아키텍처로의 진화적 전환

다음과 같은 전환은 NVIDIA GT200 에서 페르미 아키텍처 는 세 번째 세대의 GPU 컴퓨팅의 탄생을 의미합니다. 이전 아키텍처들은 수학용으로 개조된 그래픽스 우선 단위였지만, 페르미는 처음부터 GPGPU(일반 목적 GPU) 응용 프로그램을 위해 설계되었습니다.

1. 그래픽스 중심에서 컴퓨팅 중심으로

GT200는 텍스처 유닛과 고정된 데이터 병렬 처리에 집중했지만, 페르미는 통합 메모리 요청 경로를 도입했습니다. 이러한 변화는 계산적 사고개발자들이 단순한 2차원 격자 매핑을 넘어서 복잡한 C++ 알고리즘으로 나아갈 수 있도록 해줍니다.

2. 메모리 계층 구조의 획기적 발전

페르미는 진정한 L1/L2 캐시 계층 구조 와 함께 IEEE 754-2008 부동 소수점 표준을 준수했습니다. 이를 통해 연구자들은 모든 바이트에 대해 '스크래치패드' 메모리(공유 메모리)를 수동으로 관리할 필요가 없어졌으며, 비정형 데이터 구조와 과학적 공학에 적합한 이중 정밀도 정확도를 가능하게 했습니다.

TERMINALbash — 80x24

> Ready. Click "Run" to execute.

QUESTION 1

Which architecture is considered the true start of the 'Third Generation' of GPU computing?

GT200 (Tesla)

Fermi

G80

Fixed-function Pipeline

QUESTION 2

What memory feature was introduced in Fermi to help handle irregular data patterns?

Manual Scratchpad only

Hardware-managed L1/L2 Cache Hierarchy

Write-only Texture Buffers

Disabling Global Memory

QUESTION 3

Fermi's compliance with IEEE 754-2008 was critical for which application type?

Simple 2D Sprite Rendering

High-precision Scientific Computing (FP64)

Text Scrolling

Basic Vertex Shading

QUESTION 4

What does 'Computational Thinking' refer to in the context of the Fermi shift?

Treating the GPU as a fixed-function signal processor.

Focusing on the physics of the problem rather than manual data movement.

Manually coding assembly for every pixel.

Using only 2D textures for storage.

QUESTION 5

How did Fermi improve thread management?

It removed the concept of Warps.

It introduced sophisticated hardware thread scheduling.

It limited threads to only 32 per GPU.

It forced all threads to run the same instruction forever.